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Short History of SOL 


e Stands for Structured Query Language 


e Developed at IBM by Donald D. Chamberlin and 
Raymond F. Boyce 


— [Chamberlin, Boyce: SEQUEL: A Structured English Query Language. 
SIGMOD Workshop, Vol. 1 1974: 249-264] 


e 1% commercial language for Codd's model 


— First commercial system: Oracle (v2); later came IBM 
products based on System R 
— Stronebraker's Ingres used QUEL, similar to SQL 
e Eventually converted to SQL 


standardized as ANSI (1986), ISO (1987) 


ISO Standard 


ISO/IEC CD 9075-1 


Information technology — Database languages — SOL — Part 1: Framework 
(SOL/Framework) 


GENERAL INFORMATION ° 

ÓN https: //www.iso.org/standard/76583.html 
Edition : 6 

Technical Committee : ISO/IEC JTC 1/SC 32 Data management and interchange 


ICS : 35.060 Languages used in information technology 


ANSI/ISO/IEC 9075:2003, "Database Language SQL” 
Parts 1 ("SQL/ Framework"), 2 ("SQL/Foundation"), 3 ("SQL/CLI"), 4 ("SQL/PSM'"), 


9 ("SQL/MED"), 10 ("SQL/OLB"), 11 ("SQL/Schemata"), 13 ("SQL/JRT"), 
14 ("SQL/XML') 


SOL vs. Theory 


e SOL can be viewed as a human-triendiy 
implementation of RA/RC 


e ... with several nontrivial differences: 


— Å relation is not a tuple set, rather a tuple bag or list 
e Repetitions allowed 
e Order is meaningful 
— NULL values can represent missing values 
e It is not the standard true/false logic, but rather the 3-valued 
logic (what is the meaning of NULL>5 or NULL<5’?) 
e More or less the same across DBs, yet different 
vendors provide different extensions 


IN 


CREATE TABLE Student(sid int, name varchar, year int); 
INSERT INTO Student VALUES 

(861, 'Alma', 2), 

(753, 'Amir', 1), 

(955, 'Ahuva', 2), 

(699, 'Adi', 2); 


CREATE TABLE Enroll(sid int, course varchar, grade int); 
INSERT INTO Enroll VALUES 

(861, 'DB', 79), 

(861, 'PL', 82), 


(753, 'PL', 93), SELECT name, course 
a ER, FROM Student, Enroll 
(699. pR: 95): WHERE Student.sid = Enroll.sid 


CREATE TABLE Course(name varchar, credit int); 
INSERT INTO Course VALUES 


('DB', 3), 
(PL', 2), 
(AT, 3): Try online (w/ PosgreSQL): 


e htto://salfiddle.com 
e https://saliteonline.com 


Outline 


e Introduction 
> + Basic SQL Queries 
* Aggregation and Grouping 
e Nested SQL Queries 
e NULLS 
e Views 


Basic SOL Query 


List of attributes List of relation names 


V iw ha | Uy 
Ay Ay, bi BE Condition over ( 
are all in R,,...,R,, attributes 


å 
Example: > Se a 
SELECT name, course 
FROM /Student, Enrol 
WHERE Student.sid = Enroll.sid 


S Ga aa A 
Az >tugent I Enroll 


sid name year sic! course 
— ss? Nr 


7 


Basic SQL to RA 


SELECT A,,...,A, 7 


ROM Pisa Rph 
WHERE Condition(B,,...,B,) 
Ta NK Condition(B,,...,B RX... XR, )) 
Nas =e 


Under the bag semantics! 


DISTINGT Eliminates Duplicate Tuples 


Eliminate duplicate tuples 


/ ta Å 
SELECT (DISTINCT) As, . a 


FROM R,,...,R, 
WHERE Condition(B,,...,B,) 


TTA... ALD Condition(Ba,...Bm) (R,x...X Ra) 


Under the set semantics! 


SELECT name 
FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 


Example 


Student 


sid course 


ETER 


U ST MCL 


SELECT “name 
FROM Student, 


Enroll 
Enroll.sid 


Student.sid name year Enroll.sid course 


KE: 61) De 


-xample 


Student 


Enroll 


sid course 


861 DB 
861 PL 
753 PL 


SELEGT name 
FROM Student, Enroll 
WHERE Student.si 


Student.sid name year Enroll.sid course 
861 Alma 2 


861 Alma 
753 Amir 


Example 


Student 


sid name year SELECT name 


FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 


Enroll 


sid course 


Example with DISTINGT 
Student Fe 


sid name year SELECT DISTINCT name 


FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 


Enroll 


sid course 


861 DB 
861 PL 
753 PL 


More SELECT Options 


e SQL allows for several important operations 
in the SELECT clause 


— Shorthand for selecting all attributes (*) 
— Attributes can be renamed 


— Attributes can be invented as functions of other 
attributes 


e (Later: aggregate functions) 


Example: Select All Attributes 


SELECT 
tudent, Enroll 


WHERE Student.sid = Enroll.sid 


Example: Attribute Naming (Aliasing) 


Student 


SELECT name AS tuden), cid AS course 
FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 


Example: Functions on Attributes 


Enroll 
= e 


sid course credit grade SELECT sid 
GERT) course, 
too ere 
areas) commer PS 
> FROM Miror 


Å D WHERE grade>69 


ke l Vv course 
K 
L 861 DB great 


753 PL 182 great 


hr > 
S 


Generalized Semantics 


Produce the tuple 
Fit 


A 
SELECT F,(t),...,F,(t) 
FROM Ri,...,R, & 


WHERE Condition(t) m, 


Go through 
/ every tuple t in 
Rix... XR, 


Such that 
Condition ct) holds 


Question 


Student 


With what we have so far, 
could you find the 
students who are enrolled 
in both DB and ALE 


HERE 
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Relation Naming (Aliasing) Eu E 


nome 


SELECT Student.sid, name 
FROM Student, Enrutt E Enrol P 

WHERE Student.sid = E.sid AND 

Student.sid - F.sid AND 


-course="DB' AND — 
F.course='PL' 


Alma 


Amir 
955 Ahuva 2 


What Does This Query Return? 


Student 


sid name year SELECT Student.sid, name 


FROM Student, Enroll E, Enroll F 
WHERE Student.sid = E.sid AND 
m s Student: sid = F.sid AND 


Enroll På 
course | 


LL Y 
? 


753 PL 


DISTINCT makes a difference? 
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The WHERE Clause 


e The WHERE clause (select op) allows to build 
/? arbitrary propositional logic using built-in 
predicates over attributes 

— Logical operators: AND, OR, NOT. 
Several built-in predicates: for example: 
— Comparisons on numbers/strings (lexicographic) 

( =, !=, >, <, >=, <=, between(x AND y) 

— Membership in lists: 
IN(%,...,Xx), NOT IN(x,,...,X,) 


EEE 


e (Later: EXISTS, > ANY, > ALL, IS NULL, ...) 
AN 
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Example 1 


Enroll 
sid course credit grade SELECT * 
861 80 FROM Enroll 


753 WHERE grade between(70 AND 95) AND 
course != 'PL' 
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Example 2 


Enroll 


sid course credit grade 


861 80 
753 


SELECT * 
FROM Enroll C 


WHERE course\IN |[('*PL', 
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/ 


Me 
OS! , 


'AI') 


Explicit Joins 


Create tuple combinations only 
when the condition is true 


SELECT name, course 


nroll ON (Student.sid-Enroll.sid) 


ay J 


INNER JOIN 
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Why Explicit Joins? 


An explicit join (JOIN...ON ) forces to “push” 
select conditions to the FROM clause 


lt does not add expressive power — you can have 
the ON condition in the WHERE clause 


But, it allows you to intervene in the join ordering 
and state explicitly the join condition 


Pros & CONS: 


/=CTonstraining the query plan (compared to 


implicit joins) å 
— May be way faster than chosen plan 
— May be easier to understand who joins when 


Explicit Joins Can be with WHERE Filtering 


Student 


course 


a 


SELECT name, course, grade 
FROM Student INNER JOIN Enroll 


ON. ent.sid = Enroll.sid) 
WHERE grade>80 | 


course 


grade 
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Multiple Joins 


Student SELECT Student.name, grade 


FROM Tours3 (00: 
a, 
(Student JOIN Enroll 
ON (Student.sid = Enroll.sip ON 
Se 
( R Se erse Amame ) ? 


course grade 
79 


Question 
V 


Student 


With what we have so far, 
could you find the ids of 
all persons (students and 

employees)? 
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Set Operations 


e We can apply union, intersection and 
difference to two (or more) queries 
— (0,) UNION (0: 


— (Q,) INTERSECT (Q,) 
— (0,) EXCEPT (05) 


s Q; and Q, must be union compatible in a 
weak sense 


— Same #attributes 
— The output adopts names of 15t subquery 
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Bag or Set Semantics? 


e Default is set semantics: 
1. Eliminate duplicates 
2. Apply operator 
3. Eliminate duplicates 
e For bag semantics, use the keyword ALL 
— (Q,) UNION ALL (Cp) 
— (Q,) INTERSECT ALL (O) 
— (Q,) EXCEPT ALL (Q>) 


Question Revisited 


Student 


sid name year (SELECT sid FROM Student) 


UNION 
955 Ahuva 2 


Employee 


(SELECT id FROM Employee) 
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What are the Results? 


Student (SELECT name FROM Student) 
sid name year UNION 
(SELECT name FROM Employee) 
(SELECT name FROM Student) 
UNION ALL 
Employee (SELECT name FROM Employee) 


(SELECT name FROM Student) 
UNTON ALL 
(SELECT name FROM Employee) 


) 
EXGEPT ALL 
(SELEGT name FROM Employee) 
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Tuple Order 


e Recall that the result of an SQL query is a 
list of tuples 
— But we usually ignore this order since there is 

no guarantee on any specific order 

* You can specify an order by sort keys, and 
then this order is guaranteed 
— But no guarantees on ties 

e And once we can control the order, we can 
ask for the top-k in the order 
— Simple: stop after k 


(ORDER BY) 


SELECT Aru. Ap 
FROM R,,...,R, 
WHERE Condition(B,,... 


, Bm) 


Example: 


SELECT + 
FROM Student, Enroll 
WHERE, Student.sid = Enroll.sid 
ORDER BY(Gourse) name 
Y 


Student Enroll 


Se _— 
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Example 1 


Student 


SELEGT * 

FROM Student, Enroll 
Student.sid = Enroll.sid 
ame, course 


ORDER BY 


Student.sid name year Enroll.sid course 
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Example 2 


Student 


SELECT * 
FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 
ORDER BY name ASC), course (ESC) 


Student.sid name year Enroll.sid course 
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Example 3 


Student 
sid name year SELEGT Student.sid, course 


FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 


ORDER BY name, course 


955 Ahuva 2 


Enroll 


Student.sid course 
861 DB 
861 PL 
753 Al 
. 753 DC 
4753 PL 


sid course 
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Top-k Tuples 


SOL allows to limit the result to only the first k 
answers, Tor some number k of choice 


SELECT A,,...,A, 
FROM R,,...,R, 
WHERE Condition(B,,...,B,) 
ORDER BY C,,...,C, 
IMITAR) 
Example: 
SELECT * 


FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 
ORDER BY name 
LIMIT 8 


Example 


Student 


SELEGT * 
FROM Student, Enroll 
WHERE Student.sid = Enroll.sid 
ORDER BY name, course 
LIMIT 3 


Enroll 


sid course 


Student.sid name year Enroll.sid course 


861 Alma 2 
861 Alma 
753 Amir 


861 DB 
861 PL 
753 Al 
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Outline 


e Introduction 
e Basic SQL Queries 
> s Aggregation and Grouping 
e Nested SQL Queries 
e NULLS 
e Views 


Exercise (for the end of this part) 


Student Posting Likes 


name Faculty id student posting 


Alma CS 
i CS 


What is the average #likes per posting 


in each faculty? Show only for faculties with 
>2 liked postings 
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Scalar vs. Aggregate Functions 


e Scalar function: tuple to value 


— ROUND(v,i): round v to i decimals ES e J 

— UPPER(v): convert string to uppercase 

— y, v+w, v*w, ...: arithmetic \ ( gil NY 

— NOW(): current time = 


* Aggregate function: table to value (typically one 


column to value) 
— SUM(C) — sum the numbers in C 
— COUNT(C) — number of rows in C 
e COUNT(*) — number of rows, same as COUNT(C) 
— AVG(C) — SUM(C)/COUNT(*) 
MAX(C) — largest value ; MIN(C) — smallest value 
Scalar over aggregates, e.g., SUM(C)*SUM(D) 


Aggregates over scalar, e.g., SUM(C*D) 
A5 


Basic Aggregate Guery 


eee 
SELECT (AggiC(A1) » +++ MAGIA) 


FROM Rz7...,R, 

WHERE Condition(B,,...,B,) 
Apa An Bi BE 
attributes of R,,....R, 


Example: ( 
sexect(Suikeredit) 
FROM Enroll 


WHERE sid=861 


sid course credit grade 
— 46 


Semantics of a Basic Aggregate Query 


Possible: Agg; (DISTINCT A, ) 


SELECT Aggı(A,),... ,Agg,(Aş) 
FROM Ras « «<4 R 
WHERE Condition(B,,...,8,) 


Each A, Is an attribute of 


O Condition(B1,.:;Bm) (R, XX R.) 


representing the whole column of A; 
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, More General Aggregate Query 


Possible: Agg; (DISTINCT Fi(t)) 


SELECT Agg,(Fi(t)),...,Ag9:(Fi(t)) 
AA Ra ra 
WHERE Condition(B,,...,B.) 


where t is a tuple in 
O Condition(B:,...,Bım) (R, XX Ri) 
And each F; is a scalar function 
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UN v 
Example Ser Å 


Enroll sevse ME) as num, 


sid course credit EEE PAL (grade >) as max 
eI ae 

FROM Eo 

WHERE G 


LE 
Ea 
No - EG GEN 


3 


Mn 
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How Many Courses? 


CC 
Enroll SELECT COUNT( course) as num 
sid course credit grade FROM Enroll — 


| 58 


el de) 3 
2 


861 AL 72 
TT 


Y 


SELECT COUNT(distinct course) as num 
FROM EnroL L / 


3 
DB 
DB SELECT COUNT(distinct UPPER(course)) as num 
PL FROM Enroll 
PL 
PL 
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What Does This Compute? 


UnbalancedCoin scalar 


scalar 


SELECT @x prob))) 
FROM UnbalancedCoin 


d attribute 


PX NPD = emp) — | |p: 
L 


Legal Query? http://sqlfiddle.com 


create table 


S(L varchar, r int); 4 ERROR: column "s.l" must appear 
insert into S values in the GROUP BY clause or be 

e CA's Lys used in an aggregate function 

LB ez iyi b 50, 

e pp) Oracle: 

('D',2); attribute 


ORA-00937: not a 
På aggregate single-group 


group function 
SELECT lL, 


FROM S. MS SQL Server: 

WHERE r>1 Column 'S.l' is invalid 
in the select list 
because it is not 


[^ 


contained in either an 
aggregate function or the 
SQLite: GROUP BY clause. 


| sum(r) 
D 11 
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What can be the 
semantics here? 


Cannot just select 
“course” (scalar) 


Average Per Course 


Enroll SELECT BA 
FROM Enroll 
WHERE course='DB' 


| =< 


DB 75 
SELECT 'PL' AS course, AVG(grade) as avg 


sid course credit grade 


AVG(grade) as avg 


FROM Enroll 
WHERE course='PL' 


SY 


PL 76 


S 
Problem with this FROM Enroll 


ELECT 'AT' AS course, AVG(grade) as avg 
Solution? WHERE coursez'AT' 
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Grouping Idea i 


SELECT agg(B) 


SELECT agg(B) 
GROUP BY A 


m ay 


Group Selection 


SELECT agg(B) 
GROUP BY A 
HAVING COUNT(*)<3 


( Y Y 
| š 
P a 
i å 


S 


NY 


< 


F UT w N NI 
YY 


NY 


SELECT agg(B) 
GROUP BY A 


NY 


Grouping with Multiple Attributes i 


SELECT agg(B) 


SELECT agg(B) 
GROUP BY A,C 


> A ) 


Grouping witn Multiple Attributes 7 


SELECT agg(B) 
GROUP BY A,C 
HAVING A>=C 


SELECT agg(B) 
GROUP BY A,C 


> 0 ) 


Grouping Syntax Why both WHERE and HAVING? 


Columns obtained only from grouping 
attributes and aggregate functions 


Output 


SELECT NENNE 


FROM Ri SSR 
WHERE Condition(Bi,...,B- Base relation 
GROUP BY (G1, > Gi emae grouping attributes 
HAVING Cond Gi Gaaga AGG, Groups 


Columns obtained only from grouping 
attributes and aggregate functions 


such that the group satisfies COnd(G1 ,..., Gi ,..., AQUg+1 > agg) 
e Each evaluation gives one row 
e Grouping attributes can be treated as scalars 
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-xercise 1: What will be the result? 


create table SELECT L 
S(l varchar, r int); 

insert into S values FROM 5 
CA < 5 eC 1), WHERE r>1 


I “e2 İyi BS) 


('B 
('C',4),('D',Ø), GROUP BY I 
CD 


ww 


Exercise 2: What will be the result? 


create table SELECT L 
S(l varchar, r int); 
insert into S values FROM 5 
CA OL A Şİİ, WHERE r>1 
Buz) B 15), 
eg 0); GROUP BY l 
'D'! (2) 


HAVING count(r)>1 


Exercise 3: What will be the result? 


SELECT S.1 
FROM S,T 
WHERE S.12T.1 AND S.r>1 
GROUP BY S.1 
HAVING count(*)>1 


What is the result? 
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-xample 


Student Enroll Course 


sid name year sid course grade name credit 
861 Alma 2 
753 Amir 


DB 
PL 


699 Adi 
729 Avia 3 


Task: Find the average grade of 2"* year 
students with at least 5 credit points. 


name average =(79*3+82* 2)/5 


Alma 80.2 
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-xample 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit points. 


SELECT... 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name AND S.year=2 
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-xample 


Stüdent Enroll 


sid name year sid course grade name credit 


e average! grade of 2'i year 


students with at least 5 credit points. 


DA 


SELECT ... 


FROM Student S, Enroll T, Course C 


WHERE S sid=T. sid AND T.course = C.name AND deyir 
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-xample 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit points. 


SELECT... 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name AND S.year=2 
GROUP BY S.sid 
HAVING SUM(C.credit)>=5; 
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Example 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit poinis. 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name AND S.year=2 
GROUP BY S.sid 


HAVING SUM(C.credit)>=5; 
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-xample 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit points. 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name AND S.year=2 
GROUP BY S.sid, S.name 


HAVING SUM(C.credit)>=5; 
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Example (condition in HAVING) 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit points. 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = G.name 


GROUP BY S.sid, S.name 
HAVING S.year=2 AND SUM(C.credit)>=5; 
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Example (condition in HAVING) 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the average grade of 2"9 year 
students with at least 5 credit points. 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name 


GROUP BY S.sid, S.name, S.year 
HAVING S/yéar=2 AND SUM(C.credit)>=5; 
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create table Student(sid int, name varchar, year int); 
insert into Student values 

(861, 'Alma', 2), 

(753, ‘Amir’, 1), 

(955, 'Ahuva', 2), 

(699, 'Adi', 2); 


create table Enroll(sid int, course varchar, grade int); 
insert into Enroll values 

(861, 'DB', 79), 

(861, 'PL', 82), 

(753, 'PL', 93), 

(955, 'DB', 99), 

(955, 'Al', 72), 

(699, 'DB', 95); 


create table Course(name varchar, credit int); 
insert into Course values 


('DB', 3), Try online (w/ PosgreSQL): 
(PL, 2), * http://sqlfiddle.com 
(AL, 3); 


e https://saliteonline.com 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student -S- 
SU WHERE S.sid=T.sid AND T.course = C.name Base relation 
GROUP BY S.sid, S.name, S.year 
HAVING S.year=2 AND SUM(C.credit)>=5; 


S.sid S.name year  Tsid course grade C.name credit 


PE ANLE 


[iss | mme 2 [sss [fos | » fos [3 
loss | ana 2 | oss/{ ay m |a| 3 
ls) aa 2 \ e Lon) os | oo | > 


course grade C.name credit 


Student 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name 


— a 


EEE AN 
1d. | .name . year 


S.name year T.sid course grade C.name credit 


ss Jama [2 EE [3]. 
| [es fama [2 [os [aw [nl ale 


69 | adi | 2 | 699 | os | 95 | æ | 3 | g4 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 


WHERE S.sid=T.sid AND T.course = C.name 


GROUP BY S.sid, S.name, S.year 
HAVING S.year-2 AND SUM(G.credit)>=5; 


ir mi 


S.sid S.name year T.sid course grade C.name credit 


861 | Alma | 2 | 861 | œ | 79 | os |/3y 


D [| Cr fama | 2 [as a je] 


C g y 
(| 955 | Ahua | 2 | 955 | vB | 99 | vB VN] 
Qs ae a js aoe İma y 


EEES or 


GROUP BY S.sid, S.name, S.year 
HAVING S.year=2 AND SUMCC.credit)>-5; 


S.sid S.name year T.sid course grade C.name credit 


A [| ama | 2 | sr | os GI] os TGT o 
861 | Alma | 2 | æ | m | (6 | e | 


average 
401/5 480. 


name average 


Ahuva” 513/6 = 85.5 


SELECT S.name, 
sum(T.grade*C.credit)*1.0/sum(C.credit) as average 


FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course = C.name 
GROUP BY S.sid, S.name, S.year 
HAVING S.year=2 AND SUM(C.credit)>=5; 


-xercise 


Student Posting Likes 


name Oo Faculty id owner 


Alma CS 23 Alma 


student posting 


23 
76 


Ahuva EE 76 Ahuva 


79 Ahuva 


What is the average #likes per posting 


cas" 

Q Q v 
UTT annuun Nİ 

"Tyros ee, luene 


in each faculty? Show only for faculties with 


three or more liked postings 


76 


Solutlon 


Student mi Likes 


name Oo Faculty owner 


Ahuva EE 


What is the average #likes per PÅ 


in each faculty? Show only for faculties with 
three or more liked postings 


SELECT S.faculty, 
count(*)*1.0/count(DISTINCT L.posting) as average 


FROM mer Posting’, student 


WHERE læpostingyP.id AND P.owner=S name 
GROUP BY S.faculty Ei bi l L 
HAVING count(DISTINGT L.posting)>2; 


student posting 


CASE Selectors (outside our scope) 


Student SELECT Student.name, 
COUNT(CASE WHEN credit=2 
THEN Course.name END) AS c2, 
COUNT(CASE WHEN credit=3 
THEN Course.name END) AS c3 
FROM Course JOIN (Student JOIN Enroll 
ON (Student.sid = Enroll.sid)) 
ON (Enroll.course = Course.name) 
GROUP BY Student.sid, Student .name 


Enroll 


sid course grade 


9 
9 
6 


6 
6 
5 
5 
5 
9 


Course | name | 2 | cg 


course credit Adi 0 1 
Alma 1 1 


1 DB 79 
1 PL 82 
3 PL 93 
5 Al 72 
9 DB 95 
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Outline 


e Introduction 

e Basic SQL Queries ~< 

* Aggregation and Grouping 
> e Nested SQL Queries 

e NULLS 

e Views 


4 


Nesting 


e Nesting: one query is nested in another 
query as a relation/value component 


e The nested query IS called a subquery 


e Where are we nesting? 
— SELECT 
e Select a value defined as a subquery 
— FROM 
e Use a subquery instead of an existing relation 
— WHERE 
e Conditions phrased via subqueries 
e Most common, most important 
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Notation 


e Denote by: 
— Nxi: a query that returns a single column 


— 1x1: a query that returns a single column and at 
most one row 


— NxM: no restriction 


e Subqueries (nested queries) are sometimes 
required to be Nx1 or 1x1 


— In PostgreSQL, 1x1 is checked at runtime, hence, this 
property is sensitive to the database 
* As opposed to being Nx1 
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Subguery in WHERE 


e Most common place for subqueries 


e Several forms: 
— As any scalar value (1x1) 
* T.grade >= (SELECT MAX(grade) FROM Enroll) 


e Empty result is treated as NULL 
— (We discuss nulls later) 


- Membership testing (Nx1) 


* T.course IN (SELECT name FROM Course WHERE 
credit=2) 


— Nonemptiness testing (NxM) 


e EXISTS (SELECT * FROM Course WHERE credit=2) 
ke 
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Example 


Student Enroll i Course 
sid name year sid course grade 
, pez 
Task: Find the studer at] got the 


maximal grade 
SEGECT S.name 
FROM Student S, Enroll T 


WHERE S.sid = T.sid AND 
T.grade ze SELECT MAX(grade) from Enroll) 


1x1 
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Refined Example 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the students that got the 
maximal grade /n some course 


Student Enroll Course 


course grade course credit 


DB 


Scope of Form Variables 


e In a WHERE and SELECT subqueries, record 


names from the FROM clauses of enclosing 
queries are accessible as constants 


— In PL terminology, the subquery is within the 
scope of the super-guery's FROM variables 


* How does that help in our example? 
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Example Revisited i 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the students that got the 
Previous example: maximal grades 


SELEGT S.name 
FROM Student S, Enroll T 
WHERE S.sid = T.sid AND 
T.grade = (SELECT MAX( grade) FROM Enroll) 


Task: Find the students that got the 
maximal grade /n some course 


SELECT S.name 
FROM Student S, Enroll T 
WHERE S.sid = T.sid AND 
T.grade = (SELECT MAX( grade) FROM Enroll 
WHERE Enroll.course=7.course) 


Example of IN 


Student Enroll Course 


course grade name credit 


Task: Find the students who attended 
courses with >100 students 


sid name year sid 


SELEGT S.name 
FROM Student S, Enroll T 

WHERE S.sid=T.sid AND 
T.course IN (__________ 
SELECT course 
FROM Enroll ) 
¡GROUP BY course | 
I 


| HAVING COUNT(*)>100 
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Example with EXISTS 
Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the students who attended 
courses that Alma took 


SELECT S.name 

FROM Student S, Enroll T 
WHERE S.sid=T.sid AND 

EXISTS ( 

SR 
' O FROM Student, Enroll | 
| WHERE Student.sid-Enroll.sid AND 
I course=T.course and name='Alma')' 


Example with EXISTS 
Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the students who attended 
courses that Alma took 


SELEGT S.name 
FROM Student S, Enroll T 
WHERE S.sid=T.sid AND 
EXISTS ( 
(| SELECT E 


| 
ı FROM Student, EMEGE | 
| WHERE Student.sid | 
| course | 


Example of NOT IN 
Student Enroll Course 


sid name year sid course grade name o credit 


Task: Find the students who attended 
courses that Alma did not take 


SELEGT S.name 
FROM Student S, Enroll T 
WHERE S.sid=T.sid AND 
T.course NOT IN ( 
SELECT T1.course 
FROM Student S1, Enroll T1 
WHERE S1.sid=T1.sid AND 
S1.name='Alma' > 
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Example with NOT EXISTS 


sid name year sid course grade name credit 


Task: Find the students who attended 
courses that Alma did not take 


SELECT S.name 
FROM Student S, Enroll T 
WHERE S.sid=T.sid AND 
NOT EXISTS ( 
SELECT + 
FROM Student, Enroll 

WHERE Student.sid=Enroll.sid AND 

course=T.course and name='Alma') 
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Nesting Inside SELECT 


e You can have 1x1 subqueries in the SELECT 
clause 


e And as in WHERE, the subquery Is in the 
scope of the FROM variables 


Example of SELECT Nesting 


sid name year sid course grade name credit 


Task: For each student-course, list student 
name, course, and max grade in the course 


I 

SELEGT name, course, ! (SELECT MAX( grade) 
ı FROM Enroll 
| WHERE course=T.course)!AS MC 


FROM Student S, Enroll T 
WHERE S.sid=T.sid; 
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Example of SELECT Nesting 


sid name year sid course grade name credit 


Task: For each student-course, list student 
name, course, and max grade in the course 


o to 
SELECT name, course, ' (SELECT MAX 


FROM MN | 
, WHERE, PN 


FROM Student S, Enroll 
WHERE S.sid=T.sid; 


94 


Nesting Inside FROM 


e You can have NxM subqueries in the FROM clause 
e Such a query acts as an ordinary input relation 
e A subquery must be named (AS...) 


e Adjacent FROM variables are not in the 
scope of the subquery! 


— Recall that the FROM clause defines the iteration of 
value assignment for the variables... 


— Of course, you can use variables of nesting queries 
(treated as constants) 
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Example of FROM Nesting 


Student Enroll Course 


sid name year sid course grade name credit 


Task: Find the credits of courses with at 
least 100 students 


SELECT C.name, C.credit 


FROM Course C, (SELECT course 


FROM Enroll 
GROUP BY course 
HAVING COUNT(*)>100) AS b 


WHERE C.name = L.course 


(Could be done with simple grouping and HAVING...) 
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Outline 


e Introduction 
e Basic SQL Queries 
* Aggregation and Grouping 
e Nested SQL Queries 
Es NULLS 
e Views 


Missing Information 


e Problem: pieces of data missing, but we need 
to keep whatever partial knowledge we have 


Enroll Course 


student course course o lecturer 


e A source tells us that Alon Is a student of Keren 
— How can we represeni it in our database? 


Enroll Course 


student course course lecturer 


L=NULL 


SQLs NULL 


e NULL is SQLs special “missing value” 


e Same queries as complete tables, but SQL 
assigns a special behavior to logic over 
NULL 
— “Three-valued logic”: true, false, unknown 


e Alas, there are Some Issues... 


Try It Yourself (PostgreSQL) 


CREATE TABLE Enroll( 

student varchar(40), 

course varchar(40)); 
INSERT INTO Enroll VALUES 
("Ahuva', 'PL'), ('Alon' ,NULL); 


Enroll 


CREATE TABLE Course( 
course varchar(40), 
Lecturer varchar(40)); 


INSERT INTO Course VALUES 
C'PL','Eran'), (NULL, 'Keren'); 


Course 


student course course lecturer 


SELECT student, lecturer 
FROM Enroll R, Course C 
WHERE R.course = C.course; 


student lecturer 


Of course, we've lost our initial association (join)... 
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Try More Yourself (psal) 


Enroll Course 


student course course lecturer 


SELECT student SELECT student SELECT student 
FROM Enroll; FROM Enroll FROM Enroll 


WHERE course='PL'; WHERE course!='!PL'; 
student 


Ahuva student | student | 
(SELECT student SELECT student 
FROM Enroll FROM Enroll 
WHERE course='PL' ) WHERE course='PL' OR course!='PL' ; 
UNION student 
(SELECT student Ahuva 
FROM Enroll Ahua | 


WHERE course!='PL'); Alon?? 
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- Another Example (1) 


Orders 


Find the paid orders: 


SELECT * 
FROM Orders 
WHERE ord IN 
(SELECT ord FROM Paid) 


SELECT * 
FROM Orders 0 
WHERE EXISTS 

(SELECT ord FROM Paid J 


WHERE B.ord = ØQ.ord) 


Paid 
customer 
c11 


Result: 


ord 
011 


NZ 


price 
100 
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Another Example (2) 


Orders 


ord customer 


o11 c11 
022 c22 


Find the unpaid orders: 


SELECT ~ 
FROM Orders 
WHERE ord NOT IN 
(SELECT ord FROM Paid) 


SELECT * 

FROM Orders 0 

WHERE NOT EXISTS 
(SELECT ord FROM Paid Ñ 


WHERE B.ord = Q.ord) 


N Result: 
ord price 
/ 033 300 
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Another Example (3) 


Orders Paid 
ord customer 


o11 c11 
c22 


Find the unpaid orders: 
SELECT * Result: 
FROM Orders 

— ord 


WHERE ord NOT IN 
(SELECT ord FROM Paid) 


Result: 


SELEGT * 
FROM Orders 0 
WHERE NOT EXISTS 
(SELECT ord FROM Paid Ñ 


WHERE B.ord = 0.ord) 
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Semantics of Null 


Principle: atomic predicates (e.g., comparison 
between two numbers) result in unknown 
(denoted U) when one or more operands is NULL 
— 5>NULL, NULL=NULL, etc. 


e Then, propositional logic over atomic predicates 


follows the three-valued logic (8VL) 

— Next 

The tuples that survive the WHERE condition are 
those where the value Is true (not false and not 
unknown) 


Three-Valued Logic (3VL) 
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Avoiding Nulls 


CREATE TABLE Enroll( 
Sid int, 


course text NOT NULL 


SQL’s DDL (Data Definition Language): constrain on non-nullity 


SELECT sid (SELECT sid FROM Enroll) 
FROM Enroll EXCEPT 


WHERE course IS NOT NULL | | (SELECT sid FROM Enroll 
WHERE course IS NULL) 


Queries: nullity testing 
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Outer Joins 


Student Address 
SSN city 


861 Haifa 
302 Nazareth 


955 Isfiya 
852 Afek 


Task: Extend student with the address 
information (convenience, join avoidance) 


Student Address 
SSN name year SSN city SSN name year city 


861 Alma 2 861 Haifa 861 Alma 2 Haifa 
753 Amir 1 302 Nazareth | — 852 Ahuva 2 Afek 
52 2 


4 . 
852 Afek Weve lost Amir! 
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| eft Outer Join 


Student Address 
SSN city 


30 Nazareth 


61 
2 


Task: Extend student with the address 
information (convenience, join avoidance) 


Student Address 
SSN name year SSN city SSN name year city 


861 Alma 2 861 Haifa 861 Alma 2 Haifa 
753 — Amir 1 302 Nazareth | — 753 Amir 1 L 
52 2 852 Ahuva 2 Afek 


852 Afek 
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Definition 


e Let rand s be relations over schemas 
R(A,,...A,) and S(B,,....B,,), respectively 
e We define: 
RIS = (ROIS)U((R - RXS) X {(LL,..,.L)} T 


e That is, R&S contains: 
— All the tuples in the join of R and S 
— All the tuples of r that cannot be joined, padded 
with NULLs 


Right Outer Join 


Student Address 
SSN city 


861 Haifa 
302 Nazareth 
55 i 


SSN name year 


86 Alma 2 
1 


753 Amir 


|u 


L 852 Afek 
5 


aS 
SSN name year city 


ası) Ama 2 tata | 
asal am å Vak | 
| IN åren | 


05) Ni na J 
4 TE a, 


name year SSN city 
Å 861 / Haifa k 

302 7 Nazareth | 
| 955/ liya ) 
\\ 852/ _ Afek 


852/ A 
w 112 


Full Outer Join 


Student Address 
SSN city 
861 Haifa 


SSN name year 


861 2 
x 


fra = (TPASJU(TIYS)- 
T T 
Student Address 
SSN name year SSN city SSN name year city 


861 Alma 2 861 Alma 2 Haifa 
753 — Amir 1 30 Nazareth | — 852  Ahuva 2 Afek 
52 2 9 i 


302 Nazareth 
955 D Isfiye 
i 


61 
2 
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Left Outer Join in SOL 


Student Address 


sor | hamre. 


\ 357 


SELECT S.SSN, name, year, city 
FROM Student S/LEFT O JOIN) Address A 
ON (S.SSN = A.SSN) 


SSN name year city 


asi Ama 2 | ta 

EPA İT 

Casa Ahwa 27 AK 
| —— 


Right Outer Join in SOL 


Student Address 


SSN name year SSN city 


7 


61 2 861 i 


SELEGT A.SSN, name, year, city 
FROM Person S RIGHT OUTER JOIN Address A 
ON (S.SSN = A.SSN) 


SSN name year city 
861 Alma 2 Haifa 
Afek 


Nazareth 


Isfiya 


Full Outer Join in SQL 


Student Address 


SSN name year 


SSN city 


852 Ahuva 2 


955 Isfiya 
852 Afek 


SELECT * 
FROM Person S FULL OUTER JOIN Address A 
ON (S.SSN = A.SSN) 


SSSN A SSN name year city 


Alma Haifa 


L Nazareth 
L Isfiya 


What Null Means to People? 


e İt is important to realize that developers interpret 
nulls in two ways [Toussaint et al. 2022]: 
— Unknown: there is a value, we do not know what it is 
— Inapplicable: the value is missing since it does not exist 


e Various reasons for nulls; think of an app form 
— Å value is mistakenly omitted (unknown) 


— Å value Is deliberately omitted, e.g., the person does 
not wish to tell her age (unknown) 


— Å value is optional, e.g., middle name (n/a) 
— Å value Is logically irrelevant yet required by the form: 
martial status is “single” — spouse ID is “null” (n/a) 
e SVL is really designed for “unknown” 
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Try to Avoid Nulls via Schema Decomposition 


Student StudentName 


ord customer i customer 
c11 
c22 


e Boolean logic instead of 3VL 
e No magic! It just forces the user to handle missing information explicitly 
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Outline 


e Introduction 
e Basic SQL Queries 
* Aggregation and Grouping 
e Nested SQL Queries 
e NULLS 
> e Views 


Practical Problem Related to Example 


Student Enroll Course 


course grade course credit 


DB 


e Problem scenario 1 


— Almost every interesting question we have requires 
joining Student and Enroll 


— Complicates queries 
— Joint computation Is not shared 


— Nevertheless, we do not want to maintain data in the 
Join t f orm StudentCourse 


sid name year course grade 
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Another Problem 


Student Enroll Course 


course grade course credit 


DB 


e Problem scenario 2 


— Scenario: Alice asks for a project; I ask her to 
implement course suggestion for other students 


— | give her access: GRANT SELECT on Enroll to alice 


— | can live with her seeing who took what, but 
not the grades! 
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Student Enroll Course 


name year sid course grade course credit 


Alma 2 DB 
Amir 
Ahuva 


Definition: A view is a stored query of which result 
can be accessed as an ordinary relation 
MATERIALIZED 


CREATE VIEW STC as 
SELECT S.sid, T.course, C.credit 
FROM Student S, Enroll T, Course C 
WHERE S.sid=T.sid AND T.course=C.course 


SELECT sid, course FROM STC WHERE ... 


GRANT SELECT on STC to alice | 


Advantages of Views 


e İn queries, you can treat views as ordinary 
relations 


e Always updated, always correct with respect to its 
definition 
— No need to update the view once source relations are 
updated 


e Allows for simpler queries without introducing 
redundant dependencies 


e For a complicated view, the chance of a mistake is 
smaller than that of repeated subqueries 


View Management 


e Two types of views: 


— Materialized view: the view exists and constantly 
maintained by the system 


— Non-materialized view: the view is created as part 
of the query like a subquery (default in Postgres) 


Non-materialized Materialized 


Slower queries Faster queries 


No extra update overhead Slower updates 
No extra storage overhead Storage overhead 


e Incremental view maintenance IS an active and 
deep technological and research topic 


— Another fascinating topic: updating the database by 
updating the view (a.k.a. view updates) 
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